Show HN: Vision-Based, Vectorless RAG for Long Douments
github.comΒ·3hΒ·
Discuss: Hacker News
πŸ€–Advanced OCR
Flag this post
A minimalist web app for extracting text from PDFs
deepocr.ccΒ·5hΒ·
Discuss: Hacker News
πŸ“„OCR
Flag this post
Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts
arxiv.orgΒ·2d
πŸ”€Character Classification
Flag this post
Working with Digital Archives: The William S. Burroughs Project
fsuspecialcollections.wordpress.comΒ·2d
πŸ“œDocument Paleography
Flag this post
DeepSeek-OCR + LLama4 + RAG Just Revolutionized Agent OCR Forever
dev.toΒ·2dΒ·
Discuss: DEV
πŸ€–Advanced OCR
Flag this post
English text readability can be estimated using basic linguistic features, study indicates
phys.orgΒ·6h
🧠Intelligence Compression
Flag this post
From Connectivity to Capability: Rethinking the Digital Divide
internetsociety.orgΒ·5h
🌍Cultural Computing
Flag this post
Show HN: Font Finder – Find and Copy Fonts from Any Webpage
font-finder.orgΒ·19hΒ·
Discuss: Hacker News
πŸ”€Font Archaeology
Flag this post
How unstructured data turns your business into a junk drawer - and how to fix it
techradar.comΒ·4h
πŸ“„Document Digitization
Flag this post
The Man Who Invented AGI
wired.comΒ·2hΒ·
Discuss: r/technews
🏴󠁧󠁒󠁳󠁣󠁴󠁿Scottish Computing
Flag this post
Everything About Transformers
krupadave.comΒ·1d
πŸ“Text Parsing
Flag this post
One Trillion Web Pages Archived: Internet Archive Celebrates a Civilization-Scale Milestone
blog.archive.orgΒ·8h
🌐Web Archiving
Flag this post
Literary Theory for Robots by Dennis Yi Tenen
dennistenen.comΒ·3hΒ·
Discuss: Hacker News
πŸ›Digital humanities
Flag this post
Word and PowerPoint Alt Text Roundup
webaim.orgΒ·57m
πŸ“„PostScript
Flag this post
Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images
hotorslop.comΒ·17hΒ·
Discuss: Hacker News
πŸ“ŠLearned Metrics
Flag this post
Building a Visual Diff System for AI Edits (Like Git Blame for LLM Changes)
news.ycombinator.comΒ·26mΒ·
Discuss: Hacker News
🎯Gradual Typing
Flag this post
Accessibility of STEM documents - talk at PDF days 2025 in Berlin
latex-project.orgΒ·1d
πŸ“„PDF Internals
Flag this post
AI Experiments: Fast Inference with Groq and Third-Party Tools with Kimi K2 in TypingMind
macstories.netΒ·1d
πŸŒ€Brotli Dictionary
Flag this post
Scripts That Don’t Fit: The Hidden Bias of NLP in South Asian Languages
digitalorientalist.comΒ·3d
πŸ›Digital humanities
Flag this post
AI Powered Search Optimization: The Strategic Evolution of Digital Intelligence
future.forem.comΒ·7hΒ·
Discuss: DEV
πŸ€–AI Curation
Flag this post